49 research outputs found
Numerically Stable Recurrence Relations for the Communication Hiding Pipelined Conjugate Gradient Method
Pipelined Krylov subspace methods (also referred to as communication-hiding
methods) have been proposed in the literature as a scalable alternative to
classic Krylov subspace algorithms for iteratively computing the solution to a
large linear system in parallel. For symmetric and positive definite system
matrices the pipelined Conjugate Gradient method outperforms its classic
Conjugate Gradient counterpart on large scale distributed memory hardware by
overlapping global communication with essential computations like the
matrix-vector product, thus hiding global communication. A well-known drawback
of the pipelining technique is the (possibly significant) loss of numerical
stability. In this work a numerically stable variant of the pipelined Conjugate
Gradient algorithm is presented that avoids the propagation of local rounding
errors in the finite precision recurrence relations that construct the Krylov
subspace basis. The multi-term recurrence relation for the basis vector is
replaced by two-term recurrences, improving stability without increasing the
overall computational cost of the algorithm. The proposed modification ensures
that the pipelined Conjugate Gradient method is able to attain a highly
accurate solution independently of the pipeline length. Numerical experiments
demonstrate a combination of excellent parallel performance and improved
maximal attainable accuracy for the new pipelined Conjugate Gradient algorithm.
This work thus resolves one of the major practical restrictions for the
useability of pipelined Krylov subspace methods.Comment: 15 pages, 5 figures, 1 table, 2 algorithm
A new level-dependent coarsegrid correction scheme for indefinite Helmholtz problems
In this paper we construct and analyse a level-dependent coarsegrid
correction scheme for indefinite Helmholtz problems. This adapted multigrid
method is capable of solving the Helmholtz equation on the finest grid using a
series of multigrid cycles with a grid-dependent complex shift, leading to a
stable correction scheme on all levels. It is rigourously shown that the
adaptation of the complex shift throughout the multigrid cycle maintains the
functionality of the two-grid correction scheme, as no smooth modes are
amplified in or added to the error. In addition, a sufficiently smoothing
relaxation scheme should be applied to ensure damping of the oscillatory error
components. Numerical experiments on various benchmark problems show the method
to be competitive with or even outperform the current state-of-the-art
multigrid-preconditioned Krylov methods, like e.g. CSL-preconditioned GMRES or
BiCGStab.Comment: 21 page
A fast and robust computational method for the ionization cross sections of the driven Schroedinger equation using an O(N) multigrid-based scheme
This paper improves the convergence and robustness of a multigrid-based
solver for the cross sections of the driven Schroedinger equation. Adding an
Coupled Channel Correction Step (CCCS) after each multigrid (MG) V-cycle
efficiently removes the errors that remain after the V-cycle sweep. The
combined iterative solution scheme (MG-CCCS) is shown to feature significantly
improved convergence rates over the classical MG method at energies where bound
states dominate the solution, resulting in a fast and scalable solution method
for the complex-valued Schroedinger break-up problem for any energy regime. The
proposed solver displays optimal scaling; a solution is found in a time that is
linear in the number of unknowns. The method is validated on a 2D Temkin-Poet
model problem, and convergence results both as a solver and preconditioner are
provided to support the O(N) scalability of the method. This paper extends the
applicability of the complex contour approach for far field map computation [S.
Cools, B. Reps, W. Vanroose, An Efficient Multigrid Calculation of the Far
Field Map for Helmholtz and Schroedinger Equations, SIAM J. Sci. Comp. 36(3)
B367--B395, 2014].Comment: 24 pages, 10 figures, 1 tabl
Analyzing the effect of local rounding error propagation on the maximal attainable accuracy of the pipelined Conjugate Gradient method
Pipelined Krylov subspace methods typically offer improved strong scaling on
parallel HPC hardware compared to standard Krylov subspace methods for large
and sparse linear systems. In pipelined methods the traditional synchronization
bottleneck is mitigated by overlapping time-consuming global communications
with useful computations. However, to achieve this communication hiding
strategy, pipelined methods introduce additional recurrence relations for a
number of auxiliary variables that are required to update the approximate
solution. This paper aims at studying the influence of local rounding errors
that are introduced by the additional recurrences in the pipelined Conjugate
Gradient method. Specifically, we analyze the impact of local round-off effects
on the attainable accuracy of the pipelined CG algorithm and compare to the
traditional CG method. Furthermore, we estimate the gap between the true
residual and the recursively computed residual used in the algorithm. Based on
this estimate we suggest an automated residual replacement strategy to reduce
the loss of attainable accuracy on the final iterative solution. The resulting
pipelined CG method with residual replacement improves the maximal attainable
accuracy of pipelined CG, while maintaining the efficient parallel performance
of the pipelined method. This conclusion is substantiated by numerical results
for a variety of benchmark problems.Comment: 26 pages, 6 figures, 2 tables, 4 algorithm
A multi-level preconditioned Krylov method for the efficient solution of algebraic tomographic reconstruction problems
Classical iterative methods for tomographic reconstruction include the class
of Algebraic Reconstruction Techniques (ART). Convergence of these stationary
linear iterative methods is however notably slow. In this paper we propose the
use of Krylov solvers for tomographic linear inversion problems. These advanced
iterative methods feature fast convergence at the expense of a higher
computational cost per iteration, causing them to be generally uncompetitive
without the inclusion of a suitable preconditioner. Combining elements from
standard multigrid (MG) solvers and the theory of wavelets, a novel
wavelet-based multi-level (WMG) preconditioner is introduced, which is shown to
significantly speed-up Krylov convergence. The performance of the
WMG-preconditioned Krylov method is analyzed through a spectral analysis, and
the approach is compared to existing methods like the classical Simultaneous
Iterative Reconstruction Technique (SIRT) and unpreconditioned Krylov methods
on a 2D tomographic benchmark problem. Numerical experiments are promising,
showing the method to be competitive with the classical Algebraic
Reconstruction Techniques in terms of convergence speed and overall performance
(CPU time) as well as precision of the reconstruction.Comment: Journal of Computational and Applied Mathematics (2014), 26 pages, 13
figures, 3 table